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1 .  Introduction 


Let  A  and  B  be  matrices  of  sizes  m  by  t  and  t  by  n,  respectively, 
with  elements  in  a  field  F.  Let  x1 ,  . . . ,  x^  denote  t  independent  inde- 
terminates  over  F  and  define 

X  =  diag(x.,,  xt].  (1.1) 

I'1  * 

Then  ; 


AXB  *  Y 


(1.2) 


is  a  matrix  of  size  m  by  n  such  that  every  element  of  Y  is  a  linear 

form  in  x^  ...,  x^  over  F.  In  the  present  paper  w  investigate  the 

converse  proposition.  Thus  let  4 

Y  =  Y(x1,  xt)  (1.3) 

be  a  matrix  of  size  m  by  n  such  that  every  element  of  Y  is  a  linear 

form  in  x,,  ...,  x^  over  F.  Then  under  what  conditions  are  we  assured 
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of  the  existence  of  a  factorization  of  Y  of  the  form  (1.2)?  Our  con¬ 
ditions  turn  out  to  be  very  natural  ones  and  they  are  easily  described 
in  terms  of  compound  matrices.  We  now  state  in  entirely  elementary 
terms  a  special  case  of  one  of  our  conclusions. 

Theorem  1.1.  tet  Y  be  a  matrix  of  order  n  £  3  such  that  every 
element  of  Y  is  a  linear  form  in  x1 ,  . . . ,  xn  over  F  and  let 

X  =  diagfx^  ...»  xn).  (1.4) 

Suppose  that 

det(Y)  =  cx1  • • •  xn,  (1.5) 

where  c  ^  o  and  c  €  F,  gad  suppose  further  that  every  element  of  Y-1  is 
a  linear  form  in  x'1 ,  ...,  x^1  over  F.  Then  there  exist  matrices  A  and 
B  of  order  n  with  elements  in  F  such  that 

AXB  =  Y.  (1.6) 

Our  work  has  been  strongly  motivated  by  the  much  earlier  invest¬ 
igations  of  Kantor  [2],  Frobenius  [1],  and  Schur  [5).  These  authors 
study  a  related  problem  but  with  X  a  matrix  of  size  m  by  n  and  such  that 
the  elements  of  X  are  mn  independent  variables  over  the  complex  field. 

A  more  recent  account  of  this  theory  is  available  in  [3]. 

Finally,  we  remark  that  the  matrix  equation  (1.2)  is  of  considerable 
combinatorial  importance  in  its  own  right.  For  example,  if  A  and  B  are 
(0, 1 ) -matrices,  then  (1.2)  admits  of  a  simple  set  theoretic  interpretation. 
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The  special  case 

AXAT  =  Y,  (1.7) 

T 

where  A  is  the  transpose  of  A,  has  been  investigated  briefly  in  [4], 

But  we  do  not  pursue  the  combinatorial  aspects  of  this  subject  here. 

2.  The  Main  Theorems 

Throughout  ths  discussion  we  let  F  denote  an  arbitrary  field  and 
we  let  x1 ,  ...»  denote  t  independent  indetenninates  over  F.  We 

define 

X  =  diag[x1 ,  ...,  x^).  (2.1 ) 

We  then  form  all  of  the  products  of  x^  . . . ,  x^  taken  r  at  a  time  and 
we  always  denote  these  products  written  for  convenience  in  the 
"lexicographic"  ordering  by 

y,>  ...»  yu  («  -  (*))•  (2.2) 

Now  let 

Y  =  Y(x,,  ...,  xt)  (2.3) 

denote  a  matrix  of  size  m  by  n  such  that  every  element  of  Y  is  a  linear 
form  in  x)(  ...,  x+  over  F.  We  further  assume  that 

'  S  r  $  min(m,n)  (2.U) 

and  we  let  Cr(,Y)  denote  the  rth  compound  of  the  matrix  Y.  Thus  Cr(Y) 

is  of  size  (m)  by  (n)  and  the  elements  of  C  (Y)  are  the  determinants 
r  r  r 


a 


h 


of  the  various  submatrices  of  order  r  of  Y  displayed  within  C  (Y)  in 
the  "lexicographic"  ordering.  We  note  that  the  preceding  terminology 
implies 

Cr(X)  =  diagfy^,  yu].  (2.5) 

We  are  now  prepared  to  state  one  of  our  main  conclusions. 

Theorem  2. 1 .  Let  Y  denote  a  matrix  of  size  m  by  n  such  that 
every  element  of  Y  is  a  linear  form  in  x1 ,  . . . ,  x^.  over  F  and  let 
y . .  y  denote  the  products  of*.,  . . . ,  x,  taken  r  at  a  time.  We 

s  U  I  X  1  — — 

assume  that 

2  g  r  g  rank  (Y)  -  2  (2.6) 

and  that  every  element  of  Cr(Y)  is  a  linear  form  in  y^ ,  ...,  yy  over  F. 

Then  there  exist  matrices  A  and  B  of  sites  m  by  t  and  t  by  n,  respect¬ 
ively,  with  elements  in  F  such  that 

AXB  -  Y.  (2.7) 

We  begin  with  a  simple  lemma  concerning  the  matrix  Y  of  (2.3). 

Lemma  2.2.  Let 

Yj_  =  Y(0,  ...,  0,  x1,  0,  ...,  C)  (2.8) 

and  suppose  that 

rank  (^)<  1  (i  =  1,  ...,  t).  (2.9) 

Then  there  exist  matrices  A  and  B  of  sizes  m  by  t  and  t  by  n,  respectively. 
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with  elements  in  F  such  that 


AXB  ■  Y* 


(2.10) 


Proof.  The  assertion  rank  (Y.  )  g  1  implies  that  we  may  write 


Y  =  a  x  8  . 
i  i  i  i’ 


(2.11 ) 


where 


i  =  :  ,  \  •  (bllf  ...»  bin) 


(2.12) 


are  vectors  with  components  in  F.  Here  if  rank  (Y^)  *  1  we  have 

cr  4  0  and  8^^  0.  But  if  rank  (Y^,)  =  0  we  have  0^  =  0  and  B^  arbitrary 


or  B.  =  0  and  ql  arbitrary.  Thus 


y-  Y1  +  •••  +  Yt  “Viei  ♦  —  +  Vt0t 


(ot-j  >  ...»  ]  X  . 


(2.13) 


and  our  conclusion  follows. 


Notice  further  that  if 


rank  (Y± )  *  1  (i  -  1,  .. 


(2.14) 


and  if 


A'xb'  »  Y, 


(2.15) 
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then  there  exists  a  nonsingular  diagonal  matrix  D  with  elements  in  F 
such  that 

A'  =  AD"1,  B'  =  DB.  (2.16) 

It  is  now  clear  that  the  following  lemma  is  actually  a  reformu¬ 
lation  of  Theorem  2.1. 

Lemma  2.3.  The  matrix  Y  of  Theorem  2.1  satisfies 

rank  (Y^)  SJ  1  (i  »  1,  ...»  t).  (2.17) 

Proof.  We  remark  at  the  outset  that  the  lemma  is  elementary  for 
r  =  2.  In  this  case  rank  (Y^  1  because  otherwise  we  contradict  the 

assumption  that  every  element  of  C2(y)  is  a  linear  form  in  y,,  ...,  yu 
over  F. 

Hence  we  take  r  g  3.  Let  us  suppose  that 

rank  (Y^)  «  p  >  1  (2.18) 

for  some  i  =  1,  ...,  t.  Then  there  exist  nonsingular  matrices  P  and  Q 
of  orders  m  and  n,  respectively,  with  elements  in  F  such  that 

PY.Q  =  x1I©0.  (2.19) 

In  (2.19)  the  matrix  I  is  the  identity  matrix  of  order  p,  0  is  a  zero 
matrix,  and  the  sum  is  direct.  The  elements  -f  the  matrix 

FYQ  »  Z  (2.20) 

are  linear  forms  in  x^  ...»  x^  over  F.  It  follows  from  (2.13)  and 
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(2.19)  that  the  structure  of  Z  is  such  that  the  indeterminate  x^  appears 
in  positions  (1,1),  ...,  (p,p),  and  in  no  other  positions  in  Z.  The 
familiar  multiplicative  property  of  the  compound  matrix  implies 

c  (P)C  (Y)C  (q)  =  C  (Z),  (2.21) 

r  r  r  r 

and  by  our  assumption  on  Cr(Y)  ve  may  conclude  that  each  of  the  elements 
of  Cr(z)  is  also  a  linear  form  in  ...,  yy  over  F. 

We  designate  by  F^  the  quotient  field  of  the  polynomial  ring 

F[x1 ,  •••>  Xj_^,  x^].  (2.22) 

Ii*  this  notation  the  elements  of  Z  and  C  (Z)  are  scalars  or  polynomials 

r 

in  of  degree  1  over  F^.  In  vhat  follows  we  apply  certain  elementary 
row  and  column  operations  to  Z  with  respect  to  the  field  F^.  This  means 
that  we  determine  certain  nonsingular  matrices  p'  and  Q*  of  orders  m  and 
n,  respectively,  with  elements  in  F^  such  that 

p'zq'  «  Z'.  (2.23) 


Then  once  again  we  have 

Cr(P/)Cr(Z)Cr(Q/)  *  Cr(Z').  (2.2U) 

Thus  we  see  that  the  elements  of  z'  and  C  (Zf )  are  scalars  or  poly- 

r 

noraials  in  x  of  degree  1  over  F, . 

i  i 

We  now  write  Z  in  the  form 


Z  = 


(2.25) 
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where  W  is  of  order  p.  We  note  that  det(w)  is  a  polynomial  in  xi  of 
degree  p  >  1  over  F^.  Let  the  submatrix  of  Z  in  the  lower  right  corner 
of  Z  of  size  m  -  p  by  n  -  p  be  of  rank  p.  Then  we  may  apply  elementary 
row  and  column  operations  with  respect  to  to  the  last  m  -  p  rows  and 
the  last  n  -  p  columns  of  Z  and  replace  Z  by 

(2.26) 


In  (2.26)  the  matrix  I  is  the  identity  matrix  of  order  p  and  the  0's 
denote  zero  matrices.  We  assert  that 

p  +  0  &  r  -  1  (2.27) 

because  p  +  p  2  r  contradicts  the  fact  that  all  of  the  elements  of 
Cr(z')  are  scalars  or  polynomials  in  of  degree  1  over  F^.  Let  the 
submatrix  w'  of  z'  be  of  rank  p/.  We  have  rank  {z‘ )  =  rank  (Y)  and 
hence  we  may  conclude  that 

P  +  0  +  0*  gJ  rank  (Y).  (2.28) 

It  now  follows  from  (2.6),  (2.27),  and  (2.28)  that 

p'  >  3.  (2.29) 

We  permute  the  last  m  -  (p  +  p )  rows  and  the  first  p  columns  of 
z'  so  that  the  submatrix  of  order  2  in  the  lower  left  corner  of  w'  has 
a  nonzero  determinant.  We  then  further  permute  the  first  p  rows  of  z' 
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so  that  the  p  polynomials  in  xi  of  degree  1  over  F^  again  occupy  the 
main  diagonal  positions  of  W.  By  elementary  row  operations  with  respect 
to  Fi  we  may  replace  the  matrix  of  order  2  in  the  lower  left  corner  of 
w'  by  the  identity  matrix.  We  then  apply  further  elementary  row  oper¬ 
ations  with  respect  to  F^  and  make  all  elements  in  columns  1  and  2  of 
z'  equal  to  0,  apart  from  the  elements  in  the  (1,1),  (2,2),  (m-1,l),  (m,2) 
positions,  and  these  elements  are  equal  to  x^,  x^,  1,  1,  respectively. 

We  delete  rows  1,  2,  m-1,  m  and  columns  1,  2  from  Z'  and  call  the 
resulting  submatrix  Z.  Then  we  have 


(2.30) 


The  matrix  2  is  of  size  m  -  4  by  n  -  2.  Let  Z  be  of  rank  p.  We  have 
rank  (Z#)  =  rank  (Y)  and  hence 

p"  +  1  £  rank  (Y).  (2.31 ) 

We  assert  that 

cr_2(z)  fi  o. 


(2.32) 


10 


Suppose  on  the  contrary  that  Cy  ^ (Z )  =  C.  Then 

P  <  r  -  3-  (2.33) 

But  then  by  (2.6),  (2.31),  and  (2.33)  we  have 


rank  (Y)<j(T+lt-<r+1£  rank  ( Y )  -  1 ,  (2. 3^ ) 

and  this  is  a  contradiction.  Hence  C  _(Z)  /  0.  This  means  that  Z 

r-c 

has  a  submatrix  of  order  r-  2  with  a  nonzero  determinant.  But  this  sub- 

matrix  of  Z  in  conjunction  with  the  first  two  rows  and  columns  of  Z1 

yields  a  submatrix  of  of  order  r  whose  determinant  is  a  polynomial 

in  xi  of  degree  2  or  higher  over  F^.  This  contradicts  the  fact  that 

the  elements  of  C  (Z/)  are  scalars  or  polynomials  in  x,  of  degree  1  over 
r  1 

F .  Hence  we  have 

rank  (Yj)  $1  (i  -  1,  ...,  t).  (2.35) 


This  proves  Lemma  2.3  and  Theorem  2.1. 

The  range  of  r  in  the  preceding  theorem  cannot  in  general  be  extended 
to  r  =  rank  (Y)  -  1.  We  define 


(2.36) 


where  the  0's  denote  zero  matrices.  Then  we  have  t  ■  n  ♦  1  and  if 


n  >  ^  we  have 


1 1 


Y_1  =  diag  r 

L  X1 


(2.37) 


Hence  for  r  =  n  -  I  w  see  that  every  element  of  Cy(Y)  is  a  linear  form 
in  . ..,  over  F.  But  clearly  rank  (Yr+1 )  =  2. 

The  preceding  theorem,  however,  is  valid  for  r  *  rank  (Y)  -  1  under 
the  added  assumption  t  =  rank  (Y).  This  theorem  is  actually  a  general¬ 
ization  of  Theorem  1.1  described  in  Section  1. 

Theorem  2.k.  Let  Y  denote  a  matrix  of  size  i  n  such  that  every 
element  of  Y  is  a  linear  form  in  x^ ,  . . . ,  x^  over  F  and  let  y1 ,  . . . ,  yy 
denote  the  products  of  x1 ,  . . . ,  x^  taken  r  at  a  time.  We  assume  that 

2  £  r  »  rank  (Y)  -  1,  (2.38) 

t  =  rank  (Y),  (2.39) 

and  that  every  element  of  Cr(Y)  is  a  linear  form  in  y,,  . . .,  yu  over  F. 

Then  there  exist  matrices  A  and  B  of  sizes  ntjt  end  t  n,  respectively, 
with  elements  in  F  such  that 


AXB  -  Y. 


(2.U0) 
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Lemma  2.5.  Let  Y  be  a  nonsingular  matrix  of  order  t  ^  3  such  that 
every  element  of  Y  is  a  linear  form  in  x1 ,  . . . ,  x^  over  F.  l£t  r  =  t  -  1 
and  suppose  that  every  element  of  Cr(Y)  is  a  linear  form  in  y^  ...,  yy 
over  F.  Then 

det(Y)  =  cx1  •••  x^,  (2.41) 

where  c  f  0  and  c  €  F. 

Proof.  Let 

rank  (Y^)  =  p.  (2.42) 

We  apply  the  same  elementary  row  and  column  operations  as  in  Lemma  2.3. 

Thus  we  /mow  that  there  exist  nonsingular  matrices  P  and  Q  of  order  t 
with  elements  in  F  such  that 

PYQ  ■  Z.  (2.43) 

The  elements  of  Z  are  linear  forms  in  x^  ...,  xt  over  F.  But  the 
structure  of  Z  is  such  that  x ^  appears  in  positions  (1,0,  . ..,  (p,p), 
and  in  no  other  positions  in  Z.  We  know  that  every  element  of  Cy(Z)  is 
a  linear  form  in  y^  . . . ,  y^  over  F.  Hence  t  i  3  implies  that  we  cannot 
have  x^  in  the  (t,t)  position  of  Z.  Thus  does  not  occur  in  the  last 
column  of  Z.  An  evaluation  of  det(Z)  by  this  column  implies  that  no 
term  of  det(Z)  contains  x^^  to  a  power  higher  than  the  first.  Thus  no 
term  of  det(Y)  contains  x^  to  a  power  higher  than  the  first,  and  this 
is  valid  for  each  i  =  1,  ...,  t.  Hence  by  the  structure  of  Y  we  conclude 
that  det(Y)  is  a  nonzero  scalar  multiple  of  x,  •••  x^.. 
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The  following  lemma  completes  the  proof  of  Theorem  2.4. 

Lemma  2.6.  The  matrix  Y  of  Theorem  2.4  satisfies 

rank  (Yj)  <1  (i  -  1,  ...,  t).  (2.44) 

Proof.  We  assume  that 

rank  (Y±)  -  p  >  1  (2.45) 

for  some  i  »  1,  t.  Once  again  there  exist  nonsingular  matrices 

P  and  Q  of  orders  m  and  n,  respectively,  with  elements  in  F  such  that 

FYQ  '  Z.  (2.46) 

The  elements  of  Z  are  linear  forms  in  x^,  ...,  xt  over  F.  But  the 
structure  of  Z  is  such  that  the  indeterminate  xi  appears  in  positions 
(1,1),  . ..,  (p,p),  and  in  no  other  positions  in  Z.  Furthermore,  every 
element  of  Cr(z)  is  a  linear  form  in  y^  ...,  y^  over  F. 

The  suhoatrix  W  of  order  p  in  the  upper  left  corner  of  Z  is  non¬ 
singular  because  its  determinant  is  a  polynomial  in  xi  of  degree  p  over 
F^.  We  have 

t  =  rank  (Y)  =  rank  (z)  £  3  (2.47) 

and  hence  Z  contains  a  nonsingular  submatrix  Z1  of  order  t  with  W  in 
its  upper  left  corner.  We  now  write 

zV1  =>  I.  (2.48) 

The  elements  of  z'  are  of  the  form  ax^  +  b,  where  a,b  €  F^.  Moreover, 
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the  polynomials  in  x^  of  degree  1  over  F^  appear  in  positions 
(1,1),  ...,  (p,p),  and  in  no  other  positions  in  z' .  Every  element  of 
C  (Z')  is  a  linear  form  in  y^  ...,  y^  over  F.  Hence  by  Lemma  2.5 
every  element  of  Z/_1  is  of  the  form  cx^1  +  d,  where  c,d  €  Fi<  We  now 
multiply  row  1  of  Z1  by  column  j  of  Z/_1.  This  product  is  0  or  1 . 

Hence  the  element  in  the  (1,j)  position  of  Z/_1  is  of  the  form  cx"1, 
where  c  €  F^.  Similarly,  each  of  the  elements  in  the  first  p  rows  of 
Z'  1  is  of  this  form.  Hence 

det(z'-1 )  =  x^pf (x"1 ),  (2.49) 

where  f(x”1 )  is  a  nonzero  polynomial  in  x^1  over  F^.  But  by  Lemma  2.5 
we  have 

det(Z'_1)  -  ex‘\  (2.50) 

where  e  /  0  and  e  €  F^  This  contradicts  p  >  1,  Hence  p  =  1  and  the 


emma  is  established. 
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